Hello there!
I am trying to adapt the official Google Colab for language generation to tensorflow and everything seems to work wonderfully by simply appending TF to most of the huggingface function calls (TFAutoModel, etc)
Unfortunately, this strategy fails at the training step:
from transformers import TFTrainer, TFTrainingArguments
import tensorflow as tf
training_args = TFTrainingArguments(
"test-clm",
evaluation_strategy = "epoch",
learning_rate=2e-5)
trainer = TFTrainer(
model=model,
args = training_args,
train_dataset=lm_datasets[0:1000],
eval_dataset=lm_datasets[1000:])
trainer.train()
self.num_train_examples = self.train_dataset.cardinality().numpy()
AttributeError: 'dict' object has no attribute 'cardinality'
I have absolutely no idea what this cardinality is. Do you know what the issue can be?
Thanks!